data-centric artificial intelligence
Data-centric Artificial Intelligence: A Survey
Zha, Daochen, Bhat, Zaid Pervaiz, Lai, Kwei-Herng, Yang, Fan, Jiang, Zhimeng, Zhong, Shaochen, Hu, Xia
Artificial Intelligence (AI) is making a profound impact in almost every domain. A vital enabler of its great success is the availability of abundant and high-quality data for building machine learning models. Recently, the role of data in AI has been significantly magnified, giving rise to the emerging concept of data-centric AI. The attention of researchers and practitioners has gradually shifted from advancing model design to enhancing the quality and quantity of the data. In this survey, we discuss the necessity of data-centric AI, followed by a holistic view of three general data-centric goals (training data development, inference data development, and data maintenance) and the representative methods. We also organize the existing literature from automation and collaboration perspectives, discuss the challenges, and tabulate the benchmarks for various tasks. We believe this is the first comprehensive survey that provides a global view of a spectrum of tasks across various stages of the data lifecycle. We hope it can help the readers efficiently grasp a broad picture of this field, and equip them with the techniques and further research ideas to systematically engineer data for building AI systems. A companion list of data-centric AI resources will be regularly updated on https://github.com/daochenzha/data-centric-AI
- North America > United States > Florida > Hillsborough County > University (0.05)
- North America > United States > Texas > Brazos County > College Station (0.04)
- Europe > United Kingdom > England > Leicestershire > Leicester (0.04)
- Asia > Middle East > Jordan (0.04)
- Research Report (1.00)
- Overview (1.00)
Why it's time for "data-centric artificial intelligence"
The last 10 years have brought tremendous growth in artificial intelligence. Consumer internet companies have gathered vast amounts of data, which has been used to train powerful machine learning programs. Machine learning algorithms are widely available for many commercial applications, and some are open source. Now it's time to focus on the data that fuels these systems, according to AI pioneer Andrew Ng, SM '98, the founder of the Google Brain research lab, co-founder of Coursera, and former chief scientist at Baidu. Ng advocates for "data-centric AI," which he describes as "the discipline of systematically engineering the data needed to build a successful AI system."
- Education (0.92)
- Health & Medicine (0.80)
- Information Technology (0.57)
From Model-centric to Data-centric Artificial Intelligence
Two basic components of all AI systems are Data and Model, both go hand in hand in producing desired results. In this article we talk about how the AI community has been biased towards putting more effort in the model, and see how it is not always the best approach. We all know that machine learning is an iterative process, because machine learning is largely an empirical science. You do not jump to the final solution by thinking about the problem, because you can no easily articulate what the solution should look like. Hence you empirically move towards better solutions.
Landing AI: Unlocking The Power Of Data-Centric Artificial Intelligence
Artificial intelligence (AI) has been hugely transformative in industries with access to huge datasets and trained algorithms to analyze and interpret them. Probably the most obvious examples of this success can be found in consumer-facing internet businesses like Google, Amazon, Netflix, or Facebook. Over the last two decades, companies such as these have grown into some of the world's largest and most powerful corporations. In many ways, their growth can be put down to their exposure to the ever-growing volumes of data being churned out by our increasingly digitized society. But if AI is going to unlock the truly world-changing value that many believe it will – rather than simply making some very smart people in Silicon Valley very rich – then businesses in other industries have to consider different approaches.
Landing AI: Unlocking The Power Of Data-Centric Artificial Intelligence
Artificial Intelligence (AI) has been hugely transformative in industries with access to huge datasets and trained algorithms to analyze and interpret them. Probably the most obvious examples of this success can be found in consumer-facing internet businesses like Google, Amazon, Netflix, or Facebook. Over the last two decades, companies such as these have grown into some of the world's largest and most powerful corporations. In many ways, their growth can be put down to their exposure to the ever-growing volumes of data being churned out by our increasingly digitized society. But if AI is going to unlock the truly world-changing value that many believe it will – rather than simply making some very smart people in Silicon Valley very rich – then businesses in other industries have to consider different approaches.
- Information Technology > Services (0.56)
- Education > Educational Setting > Online (0.32)
Landing AI: Unlocking The Power Of Data-Centric Artificial Intelligence
Artificial intelligence (AI) has been hugely transformative in industries with access to huge datasets and trained algorithms to analyze and interpret them. Probably the most obvious examples of this success can be found in consumer-facing internet businesses like Google, Amazon, Netflix, or Facebook. Over the last two decades, companies such as these have grown into some of the world's largest and most powerful corporations. In many ways, their growth can be put down to their exposure to the ever-growing volumes of data being churned out by our increasingly digitized society. But if AI is going to unlock the truly world-changing value that many believe it will – rather than simply making some very smart people in Silicon Valley very rich – then businesses in other industries have to consider different approaches.